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ABSTRACT i 

The  problem  (attributed  to  R.  M.  Karp  by  Knuth  (  see  #36  of  Ill))  is  to 
describe  thd  sequences  of  minimum  length  which  contain,  as  subsequences,  all  the 
permutations  of  an  alphabet  of  n  symbols.  This  paper  catalogs  some  of  the  easy 
observations  on  the  problem  and  proves  that  the  minimum  lengths  for  n-5,  n-6  & 
n-7  are  19,  28  and  39  respectively.  Also  presented  is  a  construction  which 
yields  (for  n>2)  many  appropriate  sequences  of  length  n2-2n+4  so  giving  an 
upper  bound  on  length  of  minimum  strings  which  matches  exactly  all  known  values. 
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Security  Clarification 


1  NOTATION. 


a)  Let  S  be  a  sequence  of  symbols.  |S|  will  be  used  to  denote  the  total 
number  of  symbols  in  S  and  so  we  observe,  for  example,  |x  y  x  z|  ■  4. 

b)  Ue  say  xcy  in  the  case  where  x  is  a  subsequence  of  y  and  we  say  "x  is 
equivalent  to  y"  if  x  can  be  obtained  from  y  by  a  simple  change  of 
alphabets  we  denote  this  equivalence  by  Y. 

(e.g.  xycxyyx,  xyzxsl231) 


c)  P ( A)  is  used  to  denote  the  set  of  sequences  which  are  permutations  of 

an  alphabet  A.  Cardinality  of  P (A)  will  be  (|A|)!.  Also,  P'(A,n)  is 
is  the  set  of  permutations  of  all  sub-alphabets  of  A'  of  size  n  (  where 
n  <  | A |  ) .  Clearly,  P(A)«P'(A,  j A] ) . 


d)  If  A  is  an  alphabet  then  CHAM  x  |  xtA'  a  Vy .  (ycP (A)  o  ycx)}  where  A' 

is  the  set  of  sequences  over  alphabet  A.  For  example  abcacba  e  Q(abc). 
Also,  Q'(A,n)  is  taken  to  be  the  set  {  x|xcA'  a  Vy.  (ycP (A, n)  o  ycx)}. 
So,  for  example,  zyxwxyz  t  Q'(wxyz,2)  . 


e)  Now,  the  LENGTHS  of  the  shortest  sequences  in  Q (A)  and  Q'(A,n)  depend 

only  on  the  SIZE  of  the  alphabet  A.  Hence,  take  M (n)  to  be  the  length 

of  the  shortest  sequence  in  Q (1  2  3...n)  and  I1'(n,m)  to  be  the  length 

of  the  shortest  sequence  In  Q'll  2  3  ...  n,  n)  . 

So,  for  example,  MUM,  H (2) -3  and  H'(n,l)-n  . 

f)  S(n)  denotes  the  n-th  symbol  of  sequence  S. 

S(n:m)  denotes  that  contiguous  subsequence  of  sequence  S  which  is  the 
symbols  from  position  number  n  in  S  to  position  number  m. 

AHS.x)  denotes  the  number  of  ocurrencec  of  the  symbol  x  in  sequence  S. 

g)  "CPAF  X"  is  just  an  abbreviation  for  "Consider  the  Permutations  of  the 
current  Alphabet  of  the  Form  X"  ,  The  greek  letters  which  appear  in  X, 
denote  arbitrary  sequences  of  symbols. 

For  example,  if  the  alphabet  under  discussion  were  abcde,  the  command 
"CPAF  bac"  would  mean  "Consider  Permutations  of  abcde  which  start  with 
b  and  end  with  c". 
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2  SOME  EASY  OBSERVATIONS. 


2.1  nu)-i. 


2.2  M(2)-3. 


2.3  f1(3)“7. 


2.4  fl'  iii,  a ) -n. 


2.o  fl'  (n,2)«(2n-l)  can  be  seen  as  follows: 


f1'(n,2)  i  2n-l  since  if  A  is  an  alphabet  of  length  n,  then  the 
sequence  AA(2:2n)  is  a  member  of  Q'(A,2). 

f1'(n.2)  2  2n-l  since  If  A  is  an  alphabet  of  size  n,  S  Is  a  member  of 
Q'(A,2)  and  |S|<2n-l  then  at  least  two  of  the  symbols  of  A  (x  and  y, 
say)  only  appear  once  in  S;  hence  1  of  the  sequences  'xy'  and  'yx' 
are  not  subsequences  of  S. 


2.6  H' (n, m) 2 (m. (2n-m+l ) /2)  (n2M,  of  course) 


This  result  is  more  easily  remembered  as 

fl'  (n,m)  2  n  +  n-1  +  n-2  + .  +  n-m+1  . 

Suppose  A  is  an  alphabet  of  size  n  and  S  is  a  sequence  from  Q'(A,m) 
of  minimum  length  (i.e.  |S|«fl'(n,m)  ).  It  is  noted  in  (2.4)  that 
M'  (n,l)»n  so  take  m>2.  Segment  S  as  Txll  where  the  sequences  T.U  and 
the  symbol  x  are  chosen  so  that  x  does  not  appear  in  T  but  all  the 
other  symbols  of  A  do.  Clearly,  |  T 1 2 (n-1) .  Now  note  that  a  I  I 
permutations  of  subalphabets  of  A  of  size  m  which  start  with  x  are 
subsequences  of  xU.  Hence  all  permutations  of  subalphabets  of  A\x 
of  size  (m-1)  are  subsequences  of  U  (  A\x  is  A  without  x  and 
|A\x|-(n-l)  ).  | U |  2  fl' (n-1, m-1),  therefore,  and  so  M'(n,m)  (which 

is  simply  |S|)  is  at  least  (n-1)  +  1  +  fl' (n-1, m-1) .  Th  is  recurrence 
relation  is  readily  solvod  to  give  the  reeult. 


2.7  f1(n)2(n.  (n+l)/2). 


Simple  corollary  of  2.6  using  H(n)-f1' (n,n) . 


M'  (n,m)<(m. (n-l)+l) 

Given  an  alphabet,  A,  of  size  n  ,  the  following  construction  gives  an 
element  of  Q'(A,m)  of  length  m*(n-l)+l  s- 

Awe>a!o^1Per!!,U!ati0ns  0f  the  alPhabet  Al,  A2,  A3,  ...  Am  such  that 
• 1  •  J  ’  A?(n,"A2(1}.  etc-  Now-  B  -  Al  A2(2:n)  A3(2:n) . .  .Am(2:n) 

is  in  Q  (A, mi  since  if  C  is  any  permutation  of  any  subalphabet  of  A 
of  size,  m,  C ( j )  is  either  in  the  J-th  component  of  B  or  IS  the  last 
symbol  of  the  (j-l)th  component  (for  j>l). 


f1(n)<(n.  n-n+1 ) 


A  simple  corol lary  01  2.8. 


M'  (n,3)-(3n-2) 


(n*3) . 


From  2.G  we  get  M' (n,3)>(3n-3) . 

From  2.8  we  get  fl'  (n,3)<(3n-2) . 

Suppose  the  lower  value  is  obtained  for  an  alphabet  A  ( | A I -n)  and  S 
is  a  sequence  of  length  3n-3  which  is  in  Q'(n,3).  Now  no  symbol  can 

T'S,  ,,°?,Cer,  S  ,or  ,hen  M  “ould  have 

| | £ (2. n (n-1 , 2)  +1) - (4n-5)  which  is  a  contradiction  for  n*3.  Hence 
there  must  be  at  least  3  symbols  which  occur  just  2  times  each  for  a 
total  of  G  times.  However  M (3) -7  so  there  must  be  some  permutation  of 
these  three  eymbole  which  ie  not  a  subsequence  of  S.  This 
contradiction  gives  us  the  result. 
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Members  of  Q(1  2  3)  of  Length  7. 

The  following  is  an  exhaustive  I ist  of  minimum  solutions  for  a  3 
symbol  alphabet.  Ue  consider,  of  course,  only  equivalence  classes 
(with  respect  to  the  operator  ■  ). 


12  3  12  13 
1232123 
12  13  12  1 


1  2  3  1  2  3  1 
1232132 
1213212 


1  2  3  1  3  2  1 


VScQ(A) .  3acA.  #(S,a)*|A|. 

Use  induction  on  the  alphabet  size.  The  case  |A|-1  is  trivial  so 

SU^8n,^e  r®8ult  h0lci8  for  aM  a,Phabet8  of  size  less  than  n,  |A |-n 
and  ScQ(A),  Segment  S  as  TxU  where  sequences  T,U  and  symbol  x  are 
chosen  so  that  x  does  not  appear  in  T  but  every  other  symbol  of  A 
does.  Use  A\x  to  denote  A  minus  symbol  x,  and  we  get  UtQ(A\x).  Now 

|A\x|  -  n-l  and  so  we  can  find  y  such  that  #(U,y)*(n-l) .  Clear lu 
ff(b,y)2n.  ° 
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2.13  VSeQ'(A,m).  Card({  a  |  atA  a  #(S,a)>m  })  >  (n-m+1) 


Let  A  be  any  alphabet,  m  be  any  integer  such  that  |A|*m  and  S  be  some 
member  of  Q'(A,m).  Select  sequence  B  -  a  permutation  of  A  such  tnat 
the  symbols  of  B  are  in  order  of  decreasing  frequency  in  S. 

Now  take  sequence  S'  to  be  the  sequence  formed  by  deleting  those 
symbols  from  S  which  are  in  B  (1 :  n-m) .  S'  is  a  member  of 
Q (B (n-m+1 s n) )  and  so  some  symbol  must  appear  at  least  m  times  in  S' 
and  hence  in  S. 

Therefore,  #(S,B(1))  fc#(S,B(2))  >.  .....fc  #(S,B(n-m+l))  z  m  which 

gives  the  quoted  result. 


2.14  M' (n.m)  >  m(n-m)+H(m) 


A  corol lary  of  2.13  . 


2.  IS  M  (4) -12. 


Take  A  to  be  the  alphabet  (sequence)  1234  . 
123412314213  €  Q (A)  and  so  11(4)512  . 


Suppose  S  c  Q (A)  and  ] S ] <12. 

Compute  the  least  integer  j  such  that  S (1*  j)  contains  each  symbol 
of  A.  Note  ji4  and  S ( j )  is  not  in  S (1: j-1). 

Considering  permutations  of  A  which  start  with  S(j),  we  get  that 
|S|  >  3  +  #(S,S(j))  +  11(3)  -  10  +  #(S,S(j))  . 

Using  | S | <12  we  get  j-4  and  #(S,S( j) )=i. 

Thereforo,  S (4)  appears  only  at  position  4  of  S.  Now  consider  the 
permutations  of  A  that  end  with  S (4)  and  get  that  4^M (3)  which 
i  s  a  contradiction. 

From  this  contradiction  we  see  that  M(4)*12. 


2.16  VA.  VxeA.  3SeQ (A ) .  #{ S.xNl 


Suppose  we  are  given  an  alphabet  A  and  x  is  some  symbol  of  A.  Ue 
take  the  subalphabet  A\x  and  find  some  member  T  from  Q(A/x). 
Clearly  TxT  c  Q(A)  and  also  #(TxT,x)-l. 

This  is  quite  a  useful  result  to  keep  in  mind  when  pondering  what 
properties  members  of  Q (A)  might  have. 
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IKS) -19. 


Takt  A  to  be  the  alphabet  (sequence)  12345  . 


i)  1234512341523145213  t  Q  (A) 
so  we  have  11  (5)  <19  . 


ii)  Suppose  ScQ (A)  and  J S | <19  . 

Break  up  S  as  T  y  U  (where  T  and  U  are  segments  of  S  and 
y  is  a  single  symbol)  such  that  Ty  is  the  shortest  initial 
segment  of  S  which  is  in  Q'(A,2)  so  |Ty|>f1'  (S,2)-9. 
Choose  x  in  T  such  that  xy  is  not  a  subsequence  of  T  (this 
is  possible  otherwise  S  was  not  segmented  as  prescribed). 

Considering  members  of  P (A)  starting  with  xy,  get 

|S|  *  9  +  11(3)  +  #(U,x)  +  #(U,y)  -  IB  +  #(U,x)  +  #(U,y). 

Now,  supposing  x  does  not  appear  in  U,  consider  subsequenres 
of  S  that  end  with  x  and  derive  the  contradiction 
| S | >M (4) +2+M (3) -21 . 

Conclude  #(U,x)2l  (and  similarly  #UJ,y)£l), 

Reconciling  inequalities,  we  get  #(U,x)-l,  #(U,y)-l,  |T|-8, 
|U j-9  and  |S|-18. 

In  U,  x  and  y  appear  just  once  each  and  so  one  sequence  of 
xy  and  yx  ,  call  it  Z,  is  not  a  subsequence  of  U. 

Consider,  then,  permutations  of  A  of  the  form  ccZ  and  get 

| T |  2  11(3)  +  #(T,x)  +  #(T,y)  >9  —  a  contradiction! 

He  therefore  conclude  that  11(5)219. 


iii)  From  i)  and  ii)  deduce  II (5) -19. 
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M (6) -28 


and  h(7)-39. 

■  ■■ 


i)  Take  A  to  be  the  alphabet  (sequence)  1  2  3  4  5  G  . 

1234561234516234156231456213 

is  in  Q(A)  so  ws  have  H(6)528  . 

The  proof  of  11(6)228  is  given  as  Appendix  1  because  it  is 
long  and  uninformative. 

These  tuo  facts  give  the  result  f1(6)-28. 

ii)  Take  A  to  be  the  alphabet  1  2  3  4  5  6  7. 


1  2 
is 


34567123456172345 

1672341E672314567 
in  Q(A)  so  ue  have  (1(7)  i  39  . 


2  1 


3 


11(7)239  (■  proved  as  appendix  2  )  and  so  ue  have  M (7) -33. 
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Minimum  Length  Solutions  for  Alphabets  of  Size  4. 


Let  A  be  the  alphabet  abed. 

He  wish  to  enumerate  the  equivalence  classes  in  Q(A) 
of  the  minimum  length  (ie  12).  Suppose  ScQ(A)  and  |S|«12. 

Lemma:  VptA.  #(S,p)*2 

pc  A  a  #(S,p)=0  is  absurd.  ...  ,,  „ 

Suppose  PiA  a  #(S,p)-l  Ue  have  that  S  has  the  orm  UpV. 
CPAF  op  to  get  |U|*t1(3)-7  ;  CPAF  pa  to  get  |V|>IH3)-7  . 
He  immediately  have  the  contradiction  |S|  -  |UpV|  ;>  lb. 

Lemma:  3p.  #(S,p)=2  .  0  .  . 

Suppose  not.  In  view  of  above  lemma,  VpcA.  #(S,p)£3  which 

is  a  violation  of  the  result  2.12  {page  3). 


Supposing  #(S,p)«2,  choose  T.U.V  such  that  S  -  TpUpV. 

CPAF  pa  to  get  |UV J ^7;  CPAF  ap  to  get  [TUI *7. 

Now  | U |  -  |U|+(|S|-12)  -  (|U|+|T|+|U|+|V|+2)-12  £  4. 

Also  | T |  -  | S | -2—  |U | - 1 V |  S  3  and  similarly  |V|<;3. 

Suppose  IT  I <3.  Thus  3xeA.  -(xeT)A-(x-p) . 

CPAF  xpa  to  give  |V|*M2i+#(V,x)-3+#(V,x) .  So  #(V,x)-0. 

CPAF  apx  to  give  the  contradiction  | T | (2) «3- 
Hencs  | T | -3  and  similarly  |V|-3  giving  |U|-4. 

Suppose  qcA  and  -(q-p)  a  #(T,q)-0. 

CPAF  qpa  to  get  #(V,q)«0.  Hence  by  a  lemma  above,  #(U,q)^Z. 
CPAF  qap  to  get  the  contradiction  |U|*t1(2)+#(U,q) *5. 

Hence  Vq.  qtA  3  (q-p  v  #(T,q)-#(V,q)-l) . 

From  this  discussion  we  get  that  there  are  representatives  of 
al  I  the  equivalence  classes  of  the  form 

a  b  c  d  U  d  V  where  |U j -4,  | V | -3,  atV,  b(V,  ccV. 

CPAF  ad  we  get  abcU  is  in  Q(a  b  c)  and  is  of  min.  length. 
Using  result  (2.11)  we  get  5  possibilities  for  U;  namely: 

(1)  abac  (2)  abca  (3)  aeba  (4)  babe  (5)  bacb. 

Similarly  UV  is  in  Q (a  b  c)  and  is  of  minimum  length. 
Performing  a  small  amount  of  hand  checking  and  using  2.11 
again  we  get  that  there  are  exactly  9  equivalence  classes:- 


abed  abca  dbac 
abed  abca  dbca 
abed  abca  deba 


abed  aeba  dbca 
abed  aeba  dcab 
abed  aeba  deba 


abed  bacb  dabc 
abed  bacb  dacb 
abed  bacb  dcab 
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An  nz  -2n  +4  Construction  for  Alphabet  of  size  n. 
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Given  an  alphabet  sequence,  A,  of  length  at  least  three,  it 
is  asserted  that  the  following  recipe  gives  a  sequence  in  Q(A). 


Set  the  sequence  variable  B  «■  A (2: n) ; 


Wri te (A) ; 

DO  (n-2)  TIMES  {  Urite(A(l));  Write(  B (1 : n-2)  ); 

B  «•  (B(n-l)  B  (1 : n-2) ) ;  }; 

Write(A(l))j  Urite(B(l)); 


The  total  number  of  symbols  written  =  n+  (n-2)*(l+n-2)  +2 

«  n2-2n+4 


We  now  verify  that  the  sequence  produced  is  Indeed  in  Q ( A) . 

First  note  that  the  operation  "  B  «•  B (n-1 ) B (1 : n-2)  "  simply 
rotates  the  sequence  of  n-1  symbols  in  B. 

Next  note  that  the  first  symbol  of  A  (we  will  call  it  a)  is 
written  exactly  n  times.  Letting  C  be  the  result  of  the  above 
construction,  we  segment  C  as  follows: 

C  ■  aJaKala. . . aYaZab  where  the  (n-1)  sequences 
J,K,l,...Y,Z  do  not  contain  the  symbol  a. 

For  convenience  we  will  use  call  J,K,L, . Y,Z  units  and  will 

refer  to  them  as  UQ],  U C21 ,  ...  UIn-11. 

Now  J  contains  all  symbols  A (2: n)  but  K,l,,..Y,Z  each  contain 
just  n-2  of  the  symbols  of  A (2: n) .  However  the  symbol  of  A (2: n) 
that  does  not  appear  in  some  unit  U Ck]  is  both  the  last  symbol 
of  U tk-1 ]  and  follows  the  a  that  follows  U(kl  in  C. 

Let  P  be  a  permutation  of  A.  We  will  show  that  P  must  be  a 
subsequence  of  C. 

Suppose  a  appears  in  the  jth  position  of  P.  We  first  show  that 
the  string  P (1 : j )  (simply  a  if  j=l)  can  be  matched  to  the 
the  head  of  C  aJaKaL. . . U C j -1] a  .  Trivially  true  if  j =1 . 

If  j>l  then  P (1 )  is  in  J,  clearly.  Also  if  j>k>l  then  P(k)  can 
be  matched  to  U Ck]  if  it  is  in  that  unit  or  else  the  last 
symbol  of  U Ck-1 ] . 

Similarly  the  n-j  symbols  of  P(j+l:n)  can  be  matched  to 
U  [  j]  all  [  j+ll  a. . .  aU  Cn-13  ab  .  If  j<k<n  then  Ptkl  will  either 
match  something  in  U Ck-1 1  or  the  symbol  which  fo1 lows  the  a 
wt  ich  fol lows  U Ck-1] . 


S 


7. 


A  Nore  General  n2-2n+4  Construction. 


It  is  asserted  that  the  following  algorithm,  regardless  of  which 
internal  choices  are  made,  also  produces  a  member  of  Q (A)  of  length  n2-2n+4. 
The  proof  of  membership  in  Q (A)  follows  by  the  same  method  used  in  proving  the 
validity  of  the  simpler  'program'.  It  is  also  readily  seen  that  the  previous 
construction  is  a  special  case  of  this  more  general  one. 


SUBROUTINE  SRls 

Write  the  symbol  [x] ; 

Write  the  symbol  Iy3 ; 

SUBROUTINE  SR2: 

SRI; 

Write  in  any  order  the  In-33  symbols  of  A  which  do  not  include 
1x3  or  Iy3  or  Iz3 . 

DO  y*-z  AND  set  z  to  the  last  symbol  written. 

SUBROUTINE  SR3:  • 

DO  SR2  In-23  TINES; 

SRI; 

SUBROUTINE  SR4: 

DO  SR2  In-33  TINES; 

SRI; 

Write  in  any  order  the  In-23  symbols  of  A  which  are  not  1x3 ,  [y] ; 
Write  the  symbol  1x3; 

NAIN  ROUTINE: 

Write  down  the  alphabet  (A); 

DO  EITHER  {  x«-A(l);  y  any  symbol  of  A(2:n-1);  z«-A(n);  } 

OR  Ix«-  A (2) ;  y  Ad);  z  -  A(n);  3; 

DO  EITHER  SR3  OR  Sr!4; 


SYNBOL  COUNT. 

If  N  symbols  are  written  each  time  a  certain  routine  is  obeyed 
then  we  say  that  the  SYNBOL  COUNT  for  that  routine  is  N. 

Symbol  Count  for  SRI  -  2  ; 

Symool  Count  for  SR2  ~  n-1  ; 

Symbol  Count  for  SR3  ■  (n-2) * (n-1 ) +2  -  n*  -3n  +4; 

Symbol  Count  for  SR4  =  (n-3)*(n-l)+(n+l)  ■  nB  -3n  +4. 

Hence  Symbol  Count  for  total  algorithm  -  n2  -  2n  +  4  . 


Note  that  no  distinct  sequences  produced  by  this  algorithm  are 
equivalent  since  all  such  begin  with  a  copy  of  the  alphabet. 


Note  also  that  every  sequence  so  produced  ends  with  some  permutation 
of  the  alphabet. 


Given  an  alphabet  A,  the  reversal  of  any  sequence  which  is  a  member  of 
q ( A)  is  also  a  member  of  Q(A) .  it  should  be  noted  that  the  the  reverse  of  any 
sequence  generated  according  to  this  construction  is  equivalent  to  some  other 
sequence  given  by  the  construction. 
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8. 


Constructing  Elements  of  L'(A,m). 
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Saction  6  contained  a  simple  construction  for  generating  elements  of 
q ( A )  (for  given  alphabet  A  of  size  n>2)  which  were  of  length  n2-2n+4  .  This 
algorithm  is  now  modified  to  generate  members  of  QMA.m)  (where  2<msn)  of 
length  mn-2m+4. 


Set  the  sequence  variable  B  A (n-m+2: n) ; 
Write (A); 

DO  m-2  TIMES  Write!  A(l:n-m+l)  ); 

Wr i te  {  B (1 : m-2)  ) ; 

B  <-  B (m-l)B(l: m-2) j 
Wr i te(  Adsn-m+l)  )i 
Wr  i  te  (  B ( 1 )  ) ; 


The  total  number  of  symbols  written  is  easily  seen  to  be 
n  +  (m-2) (n-m+1  +  m-2)  +  (n-m+1)  +1  ■  mn-2m+4  . 


Just  as  this  algorithm  is  a  modification  of  the  one  in  section  G,  the 
proof  of  the  correctness  of  the  construction  is  an  extension  of  the  previous 

proof. 


This  construction  gives  an  upper  bound  on  M' (n,m)  for  n>m>2  of  mn-2m+4 
and  so  using  this  knowledge,  ;.he  proposi  t  ion  2.14  and  the  various  values  of 
n(4),  M(5) ,  M(6)  &  M(7)  we  already  know,  we  compute  the  new  results:- 

M'  (n,4)  -  4n-4 
M' (n,5)  -  5n-G 
MMn.G)  -  Gn-8 
M' (n,7)  -  7n-10 
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9. 


Discussion. 


The  construction  of  section  7  gives  many  sequences  of  the  desired 
length.  It  rives  all  nine  equivalence  classes  of  sequences  in  Q(a  bed)  of 
length  .12,  128  classes  in  Q(a  b  c  d  e)  which  may  or  may  not  be  all  of  them, 
and  32,400  classes  from  Q (a  b  c  d  a  f).  It  does  NOT  get  all  the  sequences  of 
Q(a  b  c  d  e  f)  since  all  the  ones  produced  start  with  one  copy  of  the 
alphaoet  however  the  following  sequences  from  Q (a  b  c  d  e  f): 

abcdebfdcabedcfbadecbdfacebd 

abcdeafdcbaedcfabdecafdbcead 

(among  others  known)  00  NOT!  In  fart,  the  second  of  these  examples  does  not 
even  end  with  a  permutation  of  the  alphabet. 

An  easy  to  derive  lower  bound  on  the  numter  of  classes  is  (  (n-3)  !  )t(n-l). 


He  now  tabulate  the  known  values  of  the  functions  fl  &  M'  . 


m 

fl(m) 

m2-2m+4 

IT  (n,  iii) 

1 

1 

3 

n 

2 

3 

4 

2n-l 

3 

7 

7 

3n-2 

4 

12 

12 

4n-4 

5 

19 

19 

5n-6 

G 

28 

28 

Gn-8 

7 

39 

39 

7n-10 

The 

fact  that 

;  the  actua 

1  values  of  M (n) 

exactly  match  the  n2-2n+4 

f i gure 

for 

2  <  n 

$  7  make 

tne  construction 

relatively  important.  It  also 

suggests 

the 

obvious 

conjecture 

that  M (n)  is  exactly  n2-2n+4  for  all  n>2. 

However, 

there  is  another  competing  conjecture  which  gives  exact  fit  at  n-1, 2 

as  well 

as  the  other 

known  values  of  N(n)  but  is  more  complicated:- 

H(n) 

-  n2 

for  n-1 

n2-n+l 

for  2sns3 

n2-2n+<* 

for  4Sn$7 

n2-3n+ll 

for  8$nsl5 

where  F(3'«0  a 


n*-m*n+F  On) 
F (n)»n+2*F (n-1) . 


for  2m  sns  2.2m-l 


Of  course,  knowing  whether  the  value  for  11(8)  is  51  or  52  would  help  by 
eliminating  one  of  these  postulates. 
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It  is  surprising  that  the  best  lower  bound  we  have  on  fl (n)  is  ne/2 
since  it  would  appear  that  it  is  of  order  n2.  This  conjecture  is  readily 
stated  formal ly  as:  - 

Vk.  k<l  s  3N.  n>N  :>  (fl(n)  >  k*n2) 


It  should  be  noted  that  just  the  mechanical  checking  of  the 
membership  of  a  eequence  (over  alphabet  A)  in  Q (A)  Is  quite  time-  consuming. 
A  program  is  available  in  ALGOL  but  (although  it  includes  some  means  for 
pruning  the  tree  of  permutations)  takes  a  long  time  to  check  that  ail 
permutations  of  the  alphabet  are  subsequences  of  the  given  sequence.  The 
actual  times  on  a  PDP1Z  are  3,  17  and  60  seconds  for  alphabets  of  sizes  8,  9 
8  10  respectively. 


REFERENCE: 

1.  Chvatal ,V. ,  K I arner ,0. A. ,  Knuth.D.E.,  "Selected  Combinatorial  Research 
Problems",  Report  CS  292,  Computer  Science  Department,  Stanford  University, 
June  1972. 


APPENDIX  1.  Proof  of  H(6)*28. 


Take  A  to  be  an  alphabet  of  size  6  (  |A|«6  ). 

Moreover,  suppose  ScQ(A)  and  |S|<28  . 

Now  choose  (sequences  T,  V  and  symbols  x,y  such  that 

a)  Tx  is  the  shortest  head  of  S  that  is  in  Q' (A, 2); 

b)  yV  is  the  shortest  tail  of  S  that  is  in  Q'(A,1); 

Choose  wcT  such  that  wxx  a  ->(wxcT). 

Ue  have  immediately  that  |T|£l0  ,  | V | ^5  and  from  consid¬ 
eration  of  the  el  entente  of  P  (A)  of  the  forms  wxa  &  fy 
get  |S|fc|T|+l'f1(4),  |S|*|V|+1+I1(S),  |T|sl4,  |V|s7,  |S|*25. 

Hence  we  can  segment  S  as  the  sequence  TxUyV  ar.d  note 
10s  | T |  Sl4,  2i  |U|  sslO,  5s  |V|  s7,  25  <  |S|  s27. 

Again  CPAF  wxa  and  get  | UyV | ifl (4)  +2-14.  Hence  (using  |S|S27  ) 
|T|Sl2  and  (using  |V|s7  )  |U|fc6  .  Also  CPAF  ay  again  to 
deduce  |TxU|Sf1(5)+l«20.  Therefore,  |S|  Z  20+l+|V|  Z  26  and 
(using  |T|Sl2  )  |U|>7.  Lastly  (using  | S | 527  and  |TxU|£20),  |V|£S 

Suppose  #(U,w)-0.  Since  |yV|  s  7  but  contains  all  of  A, 
there  must  be  5  symbols  of  yV  which  appear  just  once. 

Therefore  wc  choose  p,q  such  that  p,q,x,w  are  distinct, 

-•{pq  c  yV)  and  p,q  both  appear  twice  in  T.  Ue  can  do  this 
since  only  one  symbol  of  Tx  can  appear  only  once.  Now  CPAF 
orupq  to  get  |T|  Z  11(3)  +  #(T,w)  +  #(T,p)  +  #(T,q)  z  12. 

So  | T |-12  and  #(T,w)-l.  Segment  S  as  LwMxUyV  noting  that  since 
LwMx  is  in  P (A, 2)  and  #(L,w)»0,  |H|£4.  This  gives  that  |L|s7 
and  #U1xU,w)u0  .  fl (5, 2) -9  so  we  pick  p,q  such  that  -’(pq  c  L) 
and  p,q,u  distinct.  Now  CPAF  pqwa  to  get  |yV|£M(3)+#(yV,w)fc8. 

Th i g  contradiction  glvaa  #(U,u)il  . 

Again  CPAF  wxa  and  get  |.jV|  z  M(4)+#(UyV,w)+#(yV,x)  *15. 

Use  |S|s27  to  get  |T|sll  and  use  | V | sB  to  get  |U|^8. 

Now  let  teA  be  such  that  #(U,t)-0.  As  above  we  choose  p,q 
so  that  t,p,q  are  distinct,  -’(pq  c  yV)  and  p,q  both  appear 
at  least  twice  in  T.  CPAF  rctpq  to  deduce  the  contradiction 
| Tk |  S  M (3)  +  #(Tx,t)  +  #(Tx,p)  +  M Tx,q)  Z  12  !! 

Hence  ail  symbols  appear  at  least  once  in  U. 

Yet  again  CPAF  wxa  to  get  |UyV|  z  M(4)+#(UyV)+#(UyV)  z  IB. 

As  before  deduce  |T|Sl0  and  |U|s9  .  Also  CPAF  ay  to 
give  |Tx!'|  Z  M(5)+#(TxU,y)  Z  21  and  then  | S | -27,  |V|-5 
Ue  also  have  |T|-10,  |U|-10  and  Vt.  tcA  o  tcU. 

The  proof  is  concluded  by  deriving  contradictions  in  the 
various  possible  cases  of  equality  among  u,x,y 


CASE  1. 


x»y.  and  eo  S  ■  TxUxV, 

Ue  know  #(T,x)*l  and  #(U,x)il  so  CPAF  ax  and  get  the 
contradiction  21  ■  |TxU|  i  11(5)  +  #(TxU,x)  z  22  . 


CASE  2a.  u*y  (i.e.  w,x,y  all  distinct). 

CPAF  wxciy  to  get  |U|  >  t1(3)+#(U,w)+#(U,x)+#(Uty)  *  10 
Therefore  #(U,w)  *  #(U,x)  -  #(U,y)  -  1. 

Now  this  gives  that  one  of  wx  or  xw,  call  it  Z,  is  such  that 
M2  c  U).  CPAF  aZy  and  get  |T|  *  H(3)+#(T,w)+#(T,x)+#(T,y) 
But  #(T,u)+#(T, y)  Z  3  and  so  |T|  i  11  --  contradiction!! 


CASE  2b.  u«y. 

Find  the  first  symbol  of  V  which  is  not  x  ;  call  it  z. 
Note  that  since  yVcP(A)  a  |yVf-|A|,  z  appears  just  once  in  V. 
CPAF  yxaz  to  deduce  |U|  i  H(3)+#(ll,y)+#(U,x)+#(U,z)  t  10. 
Immediately  we  see  #(U,x)  -  #(U,z)  -  1  and  so  one  of  xz.zx 
(  call  it  Z  )  is  not  a  subsequence  of  1). 

CPAF  ocZy  to  get  |T|  >  f1{3)+#(T,x)+#(T,y)+#(T,z) . 

Use  #(T,y)-t#(T,z)*3  for  the  contradiction  |T|  >  11. 


APPENDIX  2.  Proof  of  fl (7) 2:39. 

■  ■Eiei/i.i  nm  am  mmm  m  a  mmammamu 

Take  A  to  be  an  alphabet  of  size  7  (  |A|-7  ). 

Moreover,  suppose  ScQ(A)  and  | S | <39  . 


Choose  sequences  T,U,U  and  symbols  a,b,c  such  that 

a)  Ta  is  the  shortest  head  of  S  that  is  in  Q' (A, 1 ) 

b)  cU  is  the  shortest  tail  of  S  that  is  in  Q' (A, 1 ) 

c)  TaUb  is  the  shortest  head  of  S  that  is  in  QMA,2) 

Ue  segment  S  as  TaUbVcU  and  readily  proves 

6<|T|<8,  5<|U|<3,  8< ! V | <18 ,  6S|U|<8,  36s|S|*38t 
as  well  as  |T|+|U|sl5. 

Suppose  for  some  p  in  A,  #(V,p)»0. 

If  o  is  the  symbol  b,  fl'  (6,3)+#(Tallb,p)  i  18  >  | T aUb |  so  we 
can  choose  q,r,s  such  that  v’istinct(p,q,r,e)  a  -'(qrscTaUb) 
so  that  -’(qrsp  c  TaUbV).  CPAF  qrspa  we  get  ?  contradiction 

|CY|24+nn). 

Other  wise  p,b  are  distinct  and  11'  (5,3)+#(TaU)  >  17  i  | T aU |  so 
we  rechoose  q,r,s  such  that  di st inct (p,q,r , s)  a  --(qrscTaU) 
which  means  ->(qrsp  c  TaUbV).  As  before  get  a  contradiction. 
Lemma  1:  VxeA.  #(V,x)s 1  follows  from  these  contradictions. 

Suppose  pcA  a  d i st i net (a, p) .  Ue  know  #(T,p)*l  and  #(Ub,p)£l 
and  #(V,p)£l  and  #(cU,p)*l  so  conclude  #(G,p)*4.  Also  we 
have  #(V,a)4l  and  #(cU,a)*l  so  that  #<S,a)i3. 

Ue  sharpen  our  inequalities  now.  CPAF  aa  to  get  |T|S7,  |S|:»37? 
CPAF  aba  to  get  |T|  +  |U|Sl3;  CPAF  ab  to  get  |U|s7.  Hence 
Gs | T  |S7,  5s|U|s7,  13s|V|s18,  6s|U|s7,  37s|S|<38. 

Suppose, in  fact,  # (S, a) -3. 

Ue  re-segment  S  as  TaJaKaL  where  #{TJKL,a)-0  and  LcU. 

There  is  at  most  one  repeated  symbol  in  T  since  |Ta|S|A|+l. 

Let  z  denote  this  symbol  if  it  exists  else  any  symbol  of  T. 
Choose  p,q  such  that  distinct(p,q,a,z)  a  ->{pq  c  T) . 

CPAF  pqzaa  to  deduce  that  some  subsequence  G  of  KaL  belongs 
to  Q(A1)  where  A1  is  obtained  from  A  by  deleting  p,q,a,z. 
|G|*M(3)-7  so  some  symbol  of  G  appears  at  least  3  times. 

So  we  choose  y  to  be  such  a  symbol  and  note 
distinct(a.y)  a  #(T,y)-l  a  #(KaL,y)*3. 

Now  one  of  py  and  yp  (call  it  Z)  is  not  a  subsequence  of  T. 
CPAF  Zzaa  to  shou  ue  can  choose  x  with  the  properties 
di  st inct (x,y,a)  a  #(T,x)«1  a  #(KaL)£3. 


New,  one  of  the  sequences  xy  and  yx  is  not  a  subsequence 
of  T  (cal!  it  Y)  and  CFAP  Yaa  to  get 

|KaL|  >  11(4)  +  #(KaL,a)  +  #(KaL,x)  +  #(KaL,y)  2  19. 

By  symmetry  | T aJ | >19  to  give  the  contradiction  | S | >19+19+1 . 

Lemma  2*  VxeA.  #(5,x)>4  is  immediate. 

Again  CPAF  aa  to  get  |T|«6,  |S|«38,  #(S,a)«4; 

Also  CPAF  ac  to  derive  |U|-6,  |U|+|V|»23,  #(S,c)«4. 

Then  CPAF  aba  to  get  | VcU |  2  l1(5)+#,VcU,a)+<((Vcl-l,b)  2  23 
which  leads  to  1S£|V|s18  anti  5s|U|s7. 

Suppose  that  p,q  are  such  that  MpqcY).  Ue  have  that 
# (TaUb, p)+# (TaUb, q)  23  .  Now  | TaUb | <15  and  so 

| T aUb  |  <  IT  (5,3)+#(TaUb,p)+#(TaUb,q)  .  Hence  we 
choose  j,k,l  such  that  distinct(j,k,  !,p,q)  a -<(jk!  c  TaUb). 

CPAF  jklpqa  so  |cU|  2  M(2)+5  «8  >  |cU|  —  3  contradiction! 
Thus  VptA.  VqcA.  #(V,p)+#(V,q)23. 

In  particular,  letting  z  bo  the  first  symbol  of  cU  which  is  not 
one  of  -  a,b  ,  #(V,a)+#(V,b)+#(V,z)  2  5. 

CPAF  acaz  to  get  |Y|  2  M(4)+#(V,a)+#(V,b)+#(V,z)  2  17 
Thus  we  have  new  bounds  for  U, V:  —  5s|U|s6,  !7s|V|sl8  . 

Ue  now  choose  sequence  H  and  symbol  d  such  that 
dHcU  is  the  shortest  tail  of  S  in  Q(A). 

By  symmetry  with  (ne  results  for  U  we  have  that  5$|H|<6 
and  so  we  re-segment  o  as  TallbGdHcU  where 
| T | -G,  Ss|U|s6,  10s|G|sl2,  5s |H|sS,  |U|-G,  |S|-38, 

#(S,a)=4,  tf{  S,c)=4, 

Suppose  x  is  such  that  x*a  a  x*c  a  -(ecG). 

I  f  x*b  then  CPAF  abea  to  get 

|  dHcU  |  2  N  (4)  +  (# (dHcU, a) +# (dHcU, bl)+tt (dHcU, e)  2  12+3+2 

-  a  contradiction. 

I f  xxd  then  CPAF  aedc  to  get 

| TaUb |  2  M (4) +(# (TaUb, c)+# (TaUb, d))+# (TaUb, e)  2  12+3+2 

-  a  I  so  a  contradiction. 

The  remaining  case  Is  x-b»d.  Lemma  1  (with  #(S,c)«4  )  gives 
that  #(TaUb,c)<2  and  since  there  is  at  most  one  symbol  in  TaUb 
appearing  3  times,  we  choose  p,q  (not  c  or  b)  bo  that  #(TaUb,p)s2 
and  #(TaUb,q)<2.  Since  I1(3)»7  there  is  some  permutation  Z  of 
c,p,q  that  is  not  a  subsequence  of  TaUb.  CPAF  Zba  to  get 
|  HcU  |  2  M  (3)  +#  (HcU,  b)  +# (HcU , c)  +#  (HcU, p)  +#  (HcU,  q)  2  7+1+2+2+2  -  14. 
-  a  contradiction. 

From  these  3  contradictions  we  get  (xeA  a  x*a  a  x*c)  o  #(G,x)2l. 
Now  suppose  -’(acG).  Choose  p,q,r  so  that  distinct (a, p,q,r)  and 
-’(pqr  c  dHcU).  CPAF  aapqr.  Clearly  acU  (else  |T|2f1(4)  I  and  so 
#(TaUb,a)>2.  Hence 

| TaUb |  2  H (3) +# (TaUb, a) +....+#( TaUb, r)  2  7+2+2+2+2  -  15 
From  this  contradiction  we  get  #(G,a)2l  and  by  symmetry  #(G,c)2l. 
Lemma  3:  VxcA.  #(G,x)2l  follows. 


17 


Suppose  xeA  a  xxa  a  xxc.  #(T,x)»#(U,x)»1,  #(Ub,x)£l,  #(dH,x)fcl 
and  #  'G,x)fcl  to  yield 
Lemma  4:  Vx«A.  (xxa  a  xxc)  o  #(S,x)£5. 

Suppose  distinct (a, b,c) . 

Us  first  choose  z  to  be  the  first  symbol  of  U  which  is  not  a,b. 
b*a  a  b*c  so  we  have  bcG,  b«dH  giving  #(GdH,b)>2. 

zxa  a  zxc  so  we  have  zcG,  z«dH  giving  #(GdH,z)>2. 

Also  a»<c  so  aedH  and  we  have  aeG  giving  #  (GdH,  a)  £2. 

CPAF  abaz  to  derive  |GdH|  Z  M(4)+#(GdH,a)+#(GdH,b)+#(GdH,z)  £  18. 
Ue  get  from  this  that  |U|=5  and  also  #(GdH,b)  -  2  =  #(GdH,z). 
This  then  gives  that  #(S,z)<»5  and  #(S,b)»5  . 

Let  p,q,r  be  the  3  sumbols  of  the  A  which  are  not  a,b,c,z. 

#(S,a)  +  #(S,b)  +  #(S,c)  +  #(S,z)  -  4+4+S+5  -  18 
so  #(S,p)  +  #(S,q)  +  #(S,r)  ■  20. 

Since  no  symbol  appears  twice  in  TaUb,  can  choose  a  permutation 
Z  of  pqr  so  that  -(ZcTaUb). 

CPAF  Za  to  get  2S«|GdHcW|af1(4)  +  (23-6N26  -  a  contradiction. 
Similarly  'distinct(a,d,c)'  gives  a  contradiction. 

Lemma  5:  -’distinct(a,b,c)  a  ->distinct(a,d,c). 


In  view  of  lemma  5,  two  important  cases  are  a«c  and  ->(a*c). 

CASF  1.  a-c. 

Suppose  first  that  ad).  Clearly  |U|*G  and  | TaUb| -14. 

Letting  z  be  the  first  symbol  of  U  not  a,b  CPAF  abaz  to 
get  | GdH |  2:  12+#fGdH,a)+#(GdH,b)+#(GdH,z)  *17. 

But  I GdH j -17  so  we  see  #(GdH,b)  ■  2  »  #(GdH,z). 

Thus  #(S,a)+#  (S,b)+#(S,z)«14. 

Now  choose  p,q,r,s  such  that  pqrsabz  is  a  permutation  of  A  and 
#(S,p)  *  # (S, q)  2:  #(S,r)  Z  U( S,s).  Now  since  some  symbol  appears 
at  least  7  times  in  S,  #(S,p)*7  and  #(S,q)+#(S,r)+#(S,s)Sl7. 
Hence  #( S,s)$5  and  so  #(S,p)+#(S,q)+#(S,r)>19. 

Now  each  of  p,q,r  appears  exactly  twice  in  TaUb  and  so 

i)  tt (GdHaU, p)  +# (GdHaU, q)  +# (GdHaU, r)  &  13 

ii)  since  M (3) -7  there  is  a  permutation  of  pqr 
(call  it  Z  )  such  that  -(Z  c  Talib). 

CPAF  Za  to  get  24  -|GdHaU|  *  I1(4)+13  -  25. 

This  contradiction  gives  us  #(U,a)»0. 

Again  letting  z  be  the  first  symbol  of  U  not  a.b  we  have 
#( GdH,a)*2,  #(GdH,b)*2,  #(Gdh,z)>2  so  CPAF  abaz  to 

deouce  |GdH|*18  and  hence  |U|-5  and  #(S,b)»#(S,z)»5 
Similarly,  #(S,d)=5  and  |H|»5. 

| G | -12  and  #(G, a)  »  # (G, b) -2  so  the  other  5  symbols  appear 
a  total  of  8  times  in  G.  Hence  choose  p,q  so  that  ->(pqcG) 
and  dist  inct(a,b,p,q) .  -dabpq  c  TaUbG)  so  CPAF  abpqa 
to  derive  a  contradiction  |dHaU|  Z  7  +  3*2  +1-14. 


CASE  2.  -(a-c). 

He  have  a*b  and  cud  so  Lemma  5  gives  both  o-c  and  d»c. 
Hence  S  looks  like  TaUbGaHbU  with  |T|«6,  5s|U|$6,  10S|G|Sl2, 
S<|H|s6,  |U|-6,  #(G,a)-#(G,b)-l,  #(T,b)-#(U,a)-l. 

Clearly  #(TUH,a)  0  -  #(UHU,b), 

We  can  write  the  alphabet  in  order  of  decreasing  frequency  in 
S  as  pqrstab  where  all  except  a,b  occur  at  least  5  times  and 
#(S,p)>7.  Hence,  as  p.q.r.s.t  appear  a  total  of  30  times 
#(S,t)«5  and  #(S,s)SS  and  #(S,p)+#(S,q)+#(S,r)  *  19. 


CASE  2a:  |U|-5  . 

Some  permutation,  Z,  of  pqr  will  not  be  a  subsequence  of  TaUb 
so  CPAF  Zee  to  get  | GaHbU |  *  12+19-6  -  25. 

This  gives  us  that  #( S,p)+#(S,q)+#(S,r)  -  19  and  #(S,s)-6. 
We-  then  deduce  #(S,p)*7,  #(S,q)  ■  #(S,r)  ■  6. 


Now  if  z  denotes  the  last  symbol  of  T  then  CPAF  zee  to  get 
32  -  |  allbGaHbU  |  i  M(6)  +  #(S,z)  -  1  or  #(S,z)s5 
But  z*a  so  #(3,z)25  so  we  deduce  z-t. 

Similarly  the  first  symbol  of  U  is  t. 

Recall  that  -(7  c  TaUb),  #(G,aM(G,b)-l  and  note  #(G,t)-l. 
CPAF  Zaba  to  deduce  that  ab  c  G. 

CPAF  Ztboc  to  deduce  that  tb  c  G. 

Similarly  deduce  that  at  c  G. 

i.e.  a  precedes  t  precedes  b  ( m  G) . 


Suppose  t  is  not  the  last  symbol  of  U.  We  find  y,z  such  that 
(yz t  c  TaUb)  and  so  ->(yztab  c  TaUbGaH).  CPAF  yztab  for 
the  contradiction  by  which  we  can  conclude  U(5)*t. 


We  have  that  S  has  the  form  T'taU'ftbGaHbtU'  where  T't-T, 
U'f t*U  and  tU'-U  (this  defines  T',  U',  f,  U'). 

Clearly  fua,  f**b,  f*t  and  so  #(S, f)sB. 

Now  -(tf  c  TaUb)  so  CPAF  tfaab  to  get  |G|fc7+3+#(G,  f ) . 

Suppose  #(G,f)-l.  From  #(S,f)*6  aeduce  «(H,f)-2. 

Now  one  of  tf.ft  is  not  in  G  -  call  it  Z. 

CPAF  abZa  to  get  | aHbU | *7+1+2+2+3-15  -  a  contradiction. 
Hence  we  have  #(G,f)«2  and  |G|«12  so  |H | -5. 


Now  let  the  last  symbol  of  T'  be  g  and  suppose  b*g. 

-,(gb  c  TaU)  and  -(ta  c  G)  so  -(flbta  c  TaUbG). 

CPAF  gbtaa  to  get  a  contradiction. 

Hence  the  last  symbol  of  V  is  b. 

Now  -(bf  c  T'taU')  but  we  have  -(ta  c  bG)  so  -(bfta  c  TaUbG). 
CPAF  bftaa  to  get  12  -  |Hbl4|  £  7+1+1+2+2  ■  13. 

This  last  contradiction  dispenses  with  CASE  2a. 
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CASE  2b:  |H|-5  . 

The  elimination  of  this  case  is  siir' lar  to  CASE  2a. 


CASE  2c:  |U|*5  a  |H|*5. 

Ue  have  so  far  that  b  »  TatlbGaHbW  with  |T|*|U|»|H|»|U|»G 
| G | -10,  #(G,a)-#(G,b) *1,  #(TUH,a)  -  #(UHU,b)  -  0. 

Suppose  first  that  #(S,s)»5. 

Without  I o3s  of  generality  suppose  s  precedes  t  in  G. 

-’(abts  c  TallbGa).  Moreover  if  any  p,q  or  r  precedes  s  in  H 
then  CPAF  abtsa  to  get  jHbWj >7+1 +1+4-13  -  a  contradiction. 
Hence  only  t  may  precede  s  in  H. 

Si  mi  lar  ly  only  s  may  fol  low  t  in  U. 

Non  CPAF  atasb  to  get  |Gj>M(7)+#(G,a)+#(G,b)+#(G, s)+#(G, t) -11. 
The  contradiction  serves  to  give  us  #(S,s)*5. 

Hence  #(8,s)-6  and  #(S,p)-7,  #(S,q)«#(S,r)-G. 

Letting  x  be  the  duplicated  symbol  In  U  and  y  the  duplicated 
symbol  in  H,  #(U,x)-2,  # (H, y) 

If  x-y  then  #(S,x)*7  so  x-p  and  thus  #(G,x)-l. 

One  of  yt.ty  (call  it  Z)  is  not  a  subsequence  of  G. 

CPAF  abZa  to  get  |HbU| *7+1  +1+2+3*14  -  contradiction. 

Else  if  y*p  then  #(S,y)-S  (note  y*a,  y*b,  y*t)  and  #(G,y)-l 
One  of  yt.ty  (call  it  Z)  is  not  a  subsequence  of  G. 

CPAF  abZoc  to  get  | HbU j 2;7+l+l+2+3-14  -  contradiction. 

Else  xxy  a  y-p  so  xxp  and  #(3,x)-6. 

One  of  xt.tx  (call  1 1  Z)  Is  not  a  subsequence  of  G. 

CPAF  ocZab  to  get  |  Tall  J  2:7+1+1+2+3-14  -  contradiction. 

This  trio  of  contradictions  completely  eliminates  CASE  2c. 


CASES  2a,  2b,  2c  all  provided  contradictions  as  did  CASE  1 
so  the  assumption  that  ]S|<39  is  proved  impossible. 

Q.E.D. 
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